Segment-based phonetic class detection using minimum verification error (MVE) training
نویسندگان
چکیده
In this paper, we investigate the performance of segment-based detectors for three taxonomic sets of acoustic-phonetic classes. Acoustic-phonetic detectors form an important processing layer for speech event decoding in the new detection-based automatic speech recognition. In this study, detectors are trained within a minimum verification error (MVE) framework which is markedly different from the conventional maximum likelihood (ML) method. Performance evaluations are conducted upon the TIMIT database by comparing detectors trained via MVE and detectors trained via maximum likelihood. Remarkable improvement in terms of detection error reduction is observed and reported. The result is a solid manifestation of the effectiveness of the discriminative training method, particularly MVE, in the detection-based speech recognition approach. These detectors, aside from being an important processing stage in an overall speech recognition system, can also be extended for applications in diagnostic information retrieval or recognition rescoring for utterance verification.
منابع مشابه
Speaker verification using minimum verification error training
We propose a Minimum Verification Error (MVE) training scenario to design and adapt an HMM-based speaker verification system. By using the discriminative training paradigm, we show that customer and background models can be jointly estimated so that the expected number of verification errors (false accept and false reject) on the training corpus are minimized. An experimental evaluation of a fi...
متن کاملSubword-based minimum verification error (SB-MVE) training for task independent utterance verification
In this paper we formulate a training framework and present a method for task independent utterance verification. Verification-specific HMMs are defined and discriminatively trained using minimum verification error training. Task independence is accomplished by performing the verification on the subword level and training the verification models using a general phonetically balanced database th...
متن کاملTask Independent Speech Verification Using SB-MVE Trained Phone Models
Robust ASR-systems should benefit from detecting when portions of the decoded hypotheses are incorrect. This can be done by including a separate verification module based on statistical hypothesis testing. String based minimum verification error (SB-MVE) training is a promising alternative for improving the corresponding verification-models. This paper adresses a variant of SB-MVE at the phone ...
متن کاملEvolutionary minimum verification error learning of the alternative hypothesis model for LLR-based speaker verification
It is usually difficult to characterize the alternative hypothesis precisely in a log-likelihood ratio (LLR)-based speaker verification system. In a previous work, we proposed using a weighted arithmetic combination (WAC) or a weighted geometric combination (WGC) of the likelihoods of the background models instead of heuristic combinations, such as the arithmetic mean and the geometric mean, to...
متن کاملDesign of Detectors for Automatic Speech Recognition
This thesis presents methods and results for optimizing subword detectors in continuous speech. Speech detectors are useful within areas like detectionbased ASR, pronunciation training, phonetic analysis, word spotting, etc. Firstly, we propose a structure suitable for subword detection. This structure is based on the standard HMM framework, but in each detector the MFCC feature extractor and t...
متن کامل